A Supra - Classi er Architecture for Scalable
نویسندگان
چکیده
When faced with inadequate information , humans often use knowledge gained from previous experience to help them in making decisions. Even when this knowledge is spread thinly among many previous experiences, humans are able to eeectively accumulate and apply it to a current classiica-tion task of interest. Inspired by human knowledge reuse, we have previously introduced a general framework for the use of knowledge embodied in existing classi-ers to aid in a new classiication task. In this framework, a supra-classiier is built to make decisions based on the outputs of large numbers of previously trained clas-siiers designed for diierent, but possibly relevant tasks. In this article, we discuss the Hamming Nearest Neighbor (HNN) supra-classiier architecture and mathematically show its usefulness. Experiments on public domain data sets demonstrate the practicality of the framework and HNN supra-classiier when faced with very few training examples.
منابع مشابه
Eective supra-classi®ers for knowledge base construction
We explore the use of the supra-classi®er framework in the construction of a classi®er knowledge base. Previously, we introduced this framework within which labels produced by old classi®ers are used to improve the generalization performance of a new classi®er for a dierent but related classi®cation task (Bollacker and Ghosh, 1998). We showed empirically that a simple Hamming nearest neighbor ...
متن کاملSLIQ: A Fast Scalable Classi er for Data Mining
Classi cation is an important problem in the emerging eld of data mining. Although classi cation has been studied extensively in the past, most of the classi cation algorithms are designed only for memory-resident data, thus limiting their suitability for data mining large data sets. This paper discusses issues in building a scalable classier and presents the design of SLIQ, a new classi er. SL...
متن کاملOn The Design of Supra - Classi ers for
| We have recently introduced a framework for the reuse of knowledge from previously trained classiiers to improve performance in a current, possibly related classi-cation task. This framework requires the use of a supra-classiier, which makes a classiication decision based on the outputs of a large number of previously trained diverse classiiers. We discuss the performance requirements of a go...
متن کاملDynamic configuration and collaborative scheduling in supply chains based on scalable multi-agent architecture
Due to diversified and frequently changing demands from customers, technological advances and global competition, manufacturers rely on collaboration with their business partners to share costs, risks and expertise. How to take advantage of advancement of technologies to effectively support operations and create competitive advantage is critical for manufacturers to survive. To respond to these...
متن کاملSPRINT: A Scalable Parallel Classi er for Data Mining
Classi cation is an important data mining problem. Although classi cation is a wellstudied problem, most of the current classication algorithms require that all or a portion of the the entire dataset remain permanently in memory. This limits their suitability for mining over large databases. We present a new decision-tree-based classi cation algorithm, called SPRINT that removes all of the memo...
متن کامل